Developing a voiced information retrieval system for the portuguese language capable to handle both brazilian and portuguese spoken versions
نویسندگان
چکیده
The two versions of the Portuguese language, the one spoken in Portugal and the one spoken in Brazil differ both phonetically and lexically. The paper reports on a menu driven voiced information retrieval system, BomdePapo, which can handle both versions of the language. The system was developed in Delphi during a research project joining teams from the Universidade do Porto, Portugal, and the Universidade Federal do Rio de Janeiro, Brazil. Information retrieval is processed through a tree menu. The system, BomdePapo, is currently being tested on an application for accessing information on sports, culture and politics news. The paper also reports on the main phonetic differences between the two versions.
منابع مشابه
‘Minor’ Languages, ‘Broken’ Translations: On Brazilian Reworkings of an Albanian Novel
This essay approaches the challenges of global translation in the 21st century from what might still be considered a somewhat uncommon example: a direct translation of Ismail Kadaré's 1978 novel Prill e thyër (Broken April) from the original Albanian into Brazilian Portuguese in 2001. Not only does it examine and compare lexical elements in the source and target texts and the usage of translato...
متن کاملThe Presence and Influence of English in the Portuguese Financial Media
As the lingua franca of the 21st century, English has become the main language for intercultural communication for those wanting to embrace globalization. In Portugal, it is the second language of most public and private domains influencing its culture and discourses. Language contact situations transform languages by the incorporations they make from other languages and Portugal has...
متن کاملA Rule Based Pronunciation Generator and Regional Accent Databank for Portuguese
One of the major obstacles in deploying spoken language technologies (SLTs) in the developing world is a lack of key linguistic resources – e.g. electronic dictionaries, phonetically aligned corpora, pronunciation lexicons, etc. – that describe the non-dominant varieties spoken in such countries and regions. In this paper, we describe the work of the LUPo (Portuguese Unisyn Lexicon) project to ...
متن کاملProsody, syntax, and pragmatics: insubordination in spoken Brazilian Portuguese
In this paper, we approach the phenomenon of insubordination in spoken Brazilian Portuguese through data on adverbial clauses extracted from the C-ORAL-BRASIL corpus [Raso and Mello 2012]. Differently from the traditional conception of insubordination [Evans 2007], we propose a synchronic view, strongly based on the analysis of the prosody/pragmatics interface in spoken language. We show that i...
متن کاملLanguage and variety verification on broadcast news for Portuguese
This paper describes a language/accent verification system for Portuguese, that explores different type of properties: acoustic, phonotactic and prosodic. The two-stage system is designed to be used as a pre-processing module for the Portuguese Automatic Speech Recognition (ASR) system developed at INESC-ID. As the ASR system is applied everyday to transcribe the evening news from a Portuguese ...
متن کامل